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Abstract — Random linear network coding is a particularly 
decentralized approach to the multicast problem. Use of random 
network codes introduces a non-zero probability however that 
some sinks will not be able to successfully decode the required 
sources. One of the main theoretical motivations for random 
network codes stems from the lower bound on the probability 
of successful decoding reported by Ho et. al. (2003). This result 
demonstrates that all sinks in a linearly solvable network can 
successfully decode all sources provided that the random code 
field size is large enough. This paper develops a new bound on 
the probability of successful decoding. 

I. Introduction 

It has been recently proved that network layer coding can 
increase throughput, particularly for multicast scenarios [1]. 
It is also known that linear network codes [2] can achieve 
max-flow upper bounds on the throughput in a single source 
multicast network. The algebraic approach of [3] is particularly 
useful in the design and analysis of linear network codes, and 
we adopt the notation and terminology of that paper 

Random networks codes [4], [5] are linear network codes 
in which the encoding coefficients are chosen randomly from 
a finite field. The sink nodes can decode correctly if and 
only if the overall transfer matrix from the sources to each 
sink is invertible. One of the main theoretical results for 
random network codes consists of the following lower bound 
on the probability of successful decoding [4], assuming that 
the underlying network is linearly solvable over (i.e. there 
exists a linear code which satisfies the multicast requirements). 
For a network code in which some of the code coefficients are 
chosen independently and uniformly from a finite field with 
cardinality q, the probability that all d receivers can decode 
the source processes is at least 

where i' is the maximum number of links receiving signals 
with independent random coefficients in any set of links 
constituting a flow solution from all sources to any receiver 
[5]. 

A looser bound (subject to the same conditions as above) 
which depends only on rj, the total number of edges receiving 
signals with independent random coefficients is given by [4], 
[6] 

Thus provided a linear solution over Fg exists in the first 
place, the probability of successful decoding can be made 



as close to one as desired, by increasing the field size q. 
The bounds ([TJ and (|2]i rely on the special structure of the 
determinant polynomial of the transfer matrix of the network. 

This paper develops the following new lower bound. 

Theorem 1: Consider a network code in which -q edges 
receive signals with independent random coefficients chosen 
independently and uniformly from a finite field with cardinal- 
ity q. If there is some choice of coefficients for these r] edges 
that results in a solution over then the probability that all 
receivers can decode the source processes is at least 



1 



(3) 



Our approach for the proof of this theorem is to identify 
a critical sub-matrix of the Edmonds matrix whose non- 
singularity is a necessary and sufficient condition for decoding 
success. This critical matrix is different for each sink in 
the network. The new bound results directly from a nesting 
property of the critical matrices. 

In the new bound, the field size q required to attain a 
given probability of success depends only on the number 
of edges with random coefficients, and not on the number 
of sinks. The resulting d-fold reduction in the required q 
could be significant. We emphasize that (O, like ([T]l applies 
only when the underlying network is solvable over ¥q. This 
is a consequence of the conditions for applicability of the 
Schwartz-Zippel inequality, which is used in the proof of both 
bounds. Thus (O does not imply the universal existence of 
binary solutions for every network. The bounds ([T]i, (|2]) and 
(O only provide lower bounds for a given q when the network 
is solvable over F^. 

We further conjecture that for large random networks satis- 
fying certain properties, the success probability behaves as 



n 



1-1 



(4) 



where E is the total number of links in the network. 

The paper is organized as follows: Section HIl presents our 
model and introduces some algebraic notation. Section |lll] 
develops the new bound ((Sj, while Section[V]discusses random 
graphs, leading to the conjecture (|4]i. 

II. Network Coding Model 

We adopt the model from [3]. The network is represented 
by a directed acyclic graph Q = {V,£) with V = |V| nodes 
and E = \£\ edges. There are r independent, discrete source 



processes with messages belonging to F^, and d > 1 receivers. 
Each receiver node has L > r incoming edges. The muhicast 
requirement is that each receiver node can decode every source 
message from the signals on its incident edges. 

Each edge e G £ is incident to node v E V if v — head(e), 
or is an outgoing edge if v — tail(Z). The in-degree of a node v 
is din (v) and the out-degree is doui (v). The time unit is chosen 
such that the capacity of each link is one bit per unit time 
and edges with larger capacity are modeled as parallel edges. 
Without loss of generality, it can be assumed that each source 
is associated with a source node G V with c?in (sq) = and 
rfout (sq) — I, a — 1,2, . . . ,r . Similarly, each sink node tp 
has din {tp) — r and o?out {tp) = 0, /3 = 1, 2, . . . , d (it is always 
possible to obtain such a graph by introducing auxiliary nodes 
and edges). It will further be assumed that edges are labeled 
ancestrally. 

A scalar linear network code for G is an assignment of 



linear encoding functions /„ : Fg'"'"'' 



f: 



Lemma 1: Let P be a multivariate polynomial of degree 
dv, with the exponent of any individual variable at most d. 
Let each variable be chosen uniformly from Fg. Then if P is 
not identically zero, 

Pr(P^O)> {^--^ ■ (6) 
We make two remarks on this approach. First, application 
of Lemma [U to P as defined in (|5]l implies an independence of 
the events Pp^ — and Pp^ = 0. Depending on the structure 
of the network, these events may be strongly dependent. For 
example, consider Pi = P2 = ■ ■ ■ = Pd, meaning all sinks 
have identical incoming signals (Bi = B2 = ■ ■ ■ ~ Bd). Then 
Lemma [U yields a lower bound (1 — d/qY , rather than (1 — 
1/qY ■ Obviously this is an extreme example, yet it illustrates 
the point that ([T]i may be loose. 

Secondly, the modified Schwartz-Zippel bound itself can be 
very loose, as the following example shows. Let H E ]p»"x™ 
to each node with each entry /ly chosen independently with a uniform 



V E V. Such codes are sufficient for the multicast problem 
on acyclic delay networks. Following [3], define the encoding 
matrix F E ¥^^^ where Fij is the coefficient applied to the 
symbol incoming on edge i E £ for contribution to outgoing 
edge j E £. According to the assumption of ancestral ordering, 
F is strictly upper triangular. Similarly, the source matrix A E 
Fg^^ maps messages onto outgoing source edges and the sink 
matrix Bp G F^^^ maps incoming sink edges onto the sinks 
tpEV, f3= 1,2,..., d. 

Let X E F;^^*" be a row vector representing the source 
messages. Then the received vector of symbols yp E F^^** 
at sink (3 = 1,2, ... ,d is given by 



where 



yp = xMp 



Mp = A{I - py^Bj. 



Each sink can decode all sources if and only if dct(yl(/ — 
F)^^Bp) ^ for every f3 = 1, 2, . . . , d, or equivalently if 
the Edmonds matrix 



Zb = 



A 



is non-singular 

Considering the entries of A, F and Bp as variables, 
the Leibniz determinant formula provides a way of writing 
det Zp as a multivariate polynomial Pp in the a^j , fij , bij . 
Furthermore, this multivariate polynomial has degree at most 
I' but is linear in each variable individually. Therefore the 
product 

P = Y[Pfi (5) 

has degree di', with each variable of degree d or less. 

The lower bound ([U results from a modified Schwartz- 
Zippel bound, which takes into account the individual variable 
degree constraint of Pp [5, Lemma 1]. We reproduce this 
lemma for reference. 



distribution on F^. Then it is well known that 

m 

Pr (det H^0) = n„-,{q) = l[{l q^') 



(7) 



In contrast. Lemma [T] gives the lower bound 

Pr(deti? 7^0) > (l-g^i)"", (8) 

which also could be obtained from (|7]l by lower bounding each 
term in the product by the minimum term (1 — g^^). 

We emphasize that Q applies only when P is not identi- 
cally zero for every choice of variables (e.g. all coefficients 
are zero). This precludes application of Q to non-solvable 
networks, i.e. networks where every choice of F makes Zp 
singular and hence P = 0. 

In Section |III] we partially address the dependency between 
the Pp, while in Section|V]we consider large random networks, 
where we also discuss the extent to which (|7]i improves (O. 

III. The New Bound 

According to our assumption regarding sources and sinks, 
and the ancestral ordering of edges, we can further assume 
without loss of generality that 

A = [/rxr ^ry.(E—r)\ 



Bp ~ [Orxfe^ Ir 



Orx(£;-r-fe^)] ,P— 1,2,. . . ,d 

where ki > r and kp > r + kp-i, /3 > 1. This means 
that the sources inject messages into the network via edges 
1, 2, . . . , r and that each sink observes signals on r consecu- 
tively numbered edges. No sink shares edges with any other 
sink or source. See Figure [T] for an example of how to arrive 
at this formulation. 

Then the Edmonds matrix for sink (3 has the following 
structure: 



Za = 
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(9) 



where the Ui are square, upper triangular with diagonal 
elements all equal to 1. The matrices Ui and U3 are r xr, U2 
is — 2r) X {ki3 — 2r) and C/4 is {E — r — kp) x {E — r — kp). 

Definition 1: The critical matrix for sink /3 is the following 
{kp — r) X (fc^ — r) principal sub-matrix of Zp, 



(10) 



Lemma 2: The determinant of the Edmonds matrix for sink 
/3 has the same magnitude as the determinant of its critical 
matrix. 

\deiZp\ - IdetC^I 
Proof: Straightforward from either the Laplace expansion 
of detZp, or repeated application of the partitioned matrix 
determinant formula. ■ 
We can immediately apply Lemma [T| to det Cp to bound the 
probability for a given sink 



Pr (det Zf3^0) = Pr (dot 7^ 0) > 1 - 



1 



Vl3 



(11) 



where 77^ is the number of columns in Cp with variable terms, 
i.e. the number of edges in the subset {?- + 1, r + 2, . . . , kp} 
receiving signals with random coefficients. 

For the d receiver problem, we have the following very 
useful property of the critical matrices, which is guaranteed 
by their construction. 

Lemma 3 (Nesting of critical matrices): Cp-^ is a principal 
sub-matrix of Cp^ for P2 > Pi- 

Hence each critical matrix Cp has as nested principal sub- 
matrices, all the critical matrices for sinks l,2,...,/3 — 1. 

Proof: [Proof of main result ©J Let Ep, (3 ^ 1,2, . . . , d 
be the event that sink /? can decode. By Lemma |2] 
det Zp ^ ■^=^ det Cp ^ 0. Now the probability that all 
sinks can decode is given by 

Pr m E J - Pr(Ei) Pr(E2 | Ei) . . . Pr(E^ | Ei . . . E^.i) 

(12) 

Now consider Pr(E„ | Ei, . . . , E„,_i) = Pr(det C™ ^ | 
det Ci 7^ 0, . . . , det Cm-i ^ 0) for some 2 < to < /3. By 
Lemma [3] Cm can be partitioned 



V w 



for appropriate choices of [/, V, W. 

Conditioned on detC,„_i / 0, we can use the partitioned 
matrix determinant formula to write 

detCra = det(C„_i) det {W - VC-^\U) , (13) 

which (conditioned on detC„,_i ^ 0) is zero if and only if 

det (w - vc;;^\u) =0. 

Let (p„i be the multivariate polynomial corresponding to 
det C„i, and let am-i be the multivariate polynomial corre- 
sponding to det {W - VC~\U). Then from (O dcg (f>„, = 
deg(f>m-i + degam-i- This relation also holds for the degree 
of any individual variable. From the Leibniz formula and the 



structure of the Edmonds matrix (as explained previously for 
Pp), we also know that the individual degree of any variable 
in (j)rn or (pm-i is zero or one. Hence 

deg am-i= deg - deg 1 , 

and the degree of any individual variable in <j,n-i is at most 
1. Collecting results so far and applying Lemma [T] 

Pr(E™ I El, ... , Em-i) = Pr (det {W - VC;^\U) ^ O) 

Finally, substitution into ( fT2] ) results in a telescoping sum for 
the exponents, dcg(^i+deg02— deg(/)i-|-deg(/)3— deg02 + - • ■ > 
leaving only 



This directly yields (O via dv < r] = dcg 4'd ~ Vd < E . ■ 
Let 

log(l - d/q) 
z{d,q) = -. — 7^ -7-^. 

log(l - 1/q) 

Then Q is tighter than ([T]i whenever 

rj < V z{d, q). 
Furthermore, z{d,q) > d and 

lim z(d, q) = 00 

q — >d 

lim z{d, q) ~ d. 

Roughly speaking, the new bound is tighter for networks with 
E = 0{vd) and sufficiently small q. 

In some instances it may be useful to have a bound which 
depends only on the total number of edges carrying signals 
with random coefficients. Replacing v with ?/ in ([TJ results in 
(|2]i which is looser than since 

{l-d/qf <{l-llqf . 

Note that successful decoding at a particular sink (3 in 
general depends on only part of Cp. There can be a much 
smaller sub-matrix that determines singularity, for example, 
Cp might be block diagonal, with successful decoding of sink 
j3 depending only on one of the blocks (this case arises when 
there are disjoint paths from the sources to each sink). Thus 
Cp may be larger than strictly required for analysis of sink [3 
alone, however defining the critical matrix this way yields the 
nesting property that results in the new bound. 



IV. Example: The Butterfly Network 

Figure [T] shows the well-known butterfly network, with 
additional nodes and edges introduced in order to satisfy our 
assumptions on sources and sinks. The source s has r = 2 
messages, and the edge labels indicate the edge ordering. 
Edges 1 and 2 carry the two messages from the source, while 
edges 12 resp. 13 duplicate the signals on edges 5 resp. 10, 
and edges 14 resp. 15 duplicate 8 resp. 11. Supposing that 
all other edges carry random linear combinations of signals, 
ly — 7 and ?/ — 9. 



s 




h t2 



Fig. L The butterfly network. 

Figure |2] shows the structure of the Edmonds matrix Zi, 
and the nested critical matrices Ci and C2. To see how the 
nesting arises, B2 has been placed alongside. For clarity, most 
of the zeros have been omitted from each matrix. The solid 
disks represent random entries of F. 
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Fig. 2. Critical matrices for the butterfly network. 

Figure [3] shows the empirically measured probability of 



decoding success versus the field size q for the network of 
Figure [U (filled circles). This was achieved using monte-carlo 
simulation, selecting each of the coefficients uniformly from 
¥q. Results for the first ten prime fields are shown. Also shown 
are the existing bounds ([T]), dashed line, (O, solid line, and the 
new bound dot-dashed line. In this case, the new bound 
is considerably tighter. 
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Fig. 3. Success probability p versus field size q compared to bounds {T), 
j2) and j3) for the butterfly network. 

V. Random Graphs 

Successful decoding for a particular sink jS depends on the 
non-singularity of its critical matrix Cp. To obtain (O we used 
Lemma [T] to bound the probability that this matrix is non- 
singular. It is interesting to consider however circumstances 
under which (|7]i might be applicable, providing an even tighter 
bound. 

There are two main obstacles to the application of (|7|l for 
determination of the probability that dot Cp ^ 0. Firstly, 
(|7]i applies to "full" matrices, with each element chosen 
independently and uniformly from Fg. In contrast, Cp is of 
the form ( fTOb . with all elements below the r-th diagonal equal 
to zero (the strictly lower triangular part of C/2). Secondly, the 
non-zero elements in the upper portion (upper triangular part 
of U2 and all of Wu, W\i and W21) of Cp are determined by 
the topology of the network itself. For a sparsely connected 
network, the proportion of zeros in this part of the matrix will 
greatly exceed Xjq. 

Assuming that the random network code coefficients are 
chosen from the non-zero elements of F^, the total number of 
non-zero elements in F is 

(T = ^ din iy) dout (-y) < E^- 

Let p = a/E'^ be the proportion of non-zero elements. 
Ignoring the structure required by dTol) . generate a random 
m X m matrix C**^'") with elements identically distributed 
according to 

Pr [dj = t) = \ „ 

J) |_p_ 



It is a remarkable fact that provided p does not tend to zero 
or one too quickly with m, 



lim Pr 

m — >oo 



See [7] for a discussion of this threshold effect. Conditioned 
on the event that C'™' has no all-zero rows or columns (if it 
did, the network flow would anyway be infeasible regardless 
of choice of code), the requirement is 



p > — f - log m + log log m 
m \ 2 



This result even holds for independent, but non-identically 
distributed entries, as discussed by Cooper [7]. 

Now for sufficiently small p, C'^™-^ can be permuted with 
high probability into the form ( fTOl l. This leads us to conjecture 
that there exist conditions on a such that 7r„i (q) is the success 
probability for a large, randomly generated network with 
a given degree distribution. The remainder of this section 
analyzes some properties of iTmiq), and demonstrates the 
improvement that may be obtained compared to (O. 

To guarantee a particular probability p using ([H]), the field 
size q must satisfy 
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l/rn 



771 log 
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Hence the required field size increases linearly with the size 
of the matrix. 

Let -Kooiq) = limm^oo 7rm(g) then 



<Z-)=<ZV24 



1/3 



where di is the Jacobi theta function [8, Equation 8.181.3] 
and 



d 



z=0 



i=0 



Truncating the latter series gives the following lower bound. 



TTociq) > 1 



1/3 



This lower bound is compared to tTqc for the first 20 primes 
in Figure |4] For a given probability p in (|7]i, the required field 

size q for m ^ oo satisfies 
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Fig. 4. Lower bound (solid line) and TToo{q) (dots). 



VI. Concluding remarks 

Random network coding is a promising decentralized ap- 
proach for multicast. One of the main implementation con- 
siderations is the size of the finite field required to achieve a 
specified probability that every sink can decode every source. 
This paper presented a new bound on the success probability, 
which in certain circumstances is tighter that the previous 
bound. We also presented a heuristic argument that motivates 
the investigation of tighter bounds for large random networks, 
based on the distribution of rank of large random finite field 
matrices. 

Acknowledgments 

This work was performed while A. Tauste Campo was 
visiting the Institute for Telecommunications Research. This 
work was supported by the Australian Government under grant 
DP0557310, and by the Defence Science and Technology 
Organisation under contracts 4500485167 and 4500550654. 
The authors would like to thank Ian GriveU and Terence Chan 
and for helpful discussions. 

References 

[1] R. Ahlswede, N. Cai, S.-Y. R. Li, and R. W. Yeung, "Network information 
flow," IEEE Tram. Inform. Theory, vol. 46, no. 4, pp. 1204-1216, July 
2000. 

[2] S.-Y. R. Li, R. Yeung, and N. Cai, "Linear network coding," IEEE Trans. 

Inform. Theory, vol. 49, no. 2, pp. 371-381, Feb. 2003. 
[3] R. Koetter and M. Medard, "An algebraic approach to network coding," 

IEEE/ACM Tram. Networking, vol. 11, no. 5, pp. 782-795, Oct. 2003. 
[4] T. Ho, M. Medard, J. Shi, M. Effros, and D. R. Karger, "On randomized 

network coding," in 4 1st Annual AUerton Conference on Communication, 

Control and Computing, Monticello, USA, 2003. 
[5] T. Ho, R. Koetter, M. Medard, D. R. Karger, and M. Effros, "The benefits 

of coding over routing in a randomized setting," in IEEE Int. Symp. 

Inform. Theory, Yokohama, Japan, 2003. 
[6] T. Ho, M. Medard, R. Koetter, D. R. Karger, and M. Effros, "Toward 

a random operation of networks," IEEE Trans. Inform. Theory, 2004, 

submitted. 

[7] C. Cooper, "On the distribution of rank of a random matrix over a finite 
field," Random Struct. Algorithms, vol. 17, pp. 197-212, 2000. 

[8] I. S. Gradshteyn and I. M. Ryzhik, Table of Integrals, Series and Products, 
5th ed. London: Academic Press, 1994. 



which does not depend on 777. 



